AN EXTENSION OF QUANTITATIVE NONDIVERGENCE 
AND APPLICATIONS TO DIOPHANTINE EXPONENTS 



Dmitry Kleinbock 



Abstract. We present a sharpening of nondivergence estimates for unipotent (or more gener- 
ally polynomial-like) flows on homogeneous spaces. Applied to metric Diophantine approxima- 
tion, it yields precise formulas for Diophantine exponents of affine subspaces of R" and their 
nondegenerate submanifolds. 



0. Introduction 

This paper continues the theme started in 1971 by G. A. MarguHs [Mrl] when he showed 
that trajectories of one-parameter unipotent flows on SLfc(M)/ SLfe(Z) are never divergent. 
This had been earher conjectured by I. Piatetski-Shapiro, and was used by Marguhs for the 
proof of the Arithmeticity Theorem for nonuniform lattices. A decade later, S. G. Dani [Dal- 
2, Da4] modified the method of Margulis, showing that any unipotent orbit returns to big 
compact subsets with high frequency. The latter statement was part of Dani's proof [Dal] 
of finitcncss of locally finite unipotent-invariant ergodic measures and was in M. Ratner's 
proof [Rtl-2] of Raghunathan's topological conjecture. 

The next development came in 1998, when a very general explicit estimate for the above 
frequency in terms of the size of compact sets was given in the paper of Margulis and the 
author [KM]. In fact it was done in a bigger generality, namely for a large class of maps 
from R'^ into SLfc(R), which made it possible to derive important applications to metric 
Diophantine approximation on manifolds. 

To state some of the results from that paper, recall that the space 

r!fc''= SLfc(K)/SLfe(Z) 

can be identified with the space of unimodular lattices in M''', and that flk = U£>o "^here 
the sets 

Ke {Aeilk] ||v|| > e for all V e A \ {0}} 

are compact (Mahler's Compactness Criterion; see e.g. [Rg, Corollary 10.9]). Here || • || can 
be any norm on R'^, which we will assume to be Euclidean, and extend to the space of 
discrete subgroups of R'^ by letting ||r|| be the volume of the quotient space Fr/F (here and 
hereafter Fr stands for the R-linear span of F) if F ^ {0}, and 1 otherwise. 

Another notion we need to introduce is that of functions (C, Q!)-good on an open subset 
of M.'^. We postpone a definition of this property until the next section, noting that, roughly 
speaking, it can be interpreted as some kind of polynomial-like behavior. 

The following is a special case of one of the main theorems from [KM] : 
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Theorem 0.1. Let d,k e N, C,a > 0, < p < 1, and let a ball B = B{xo,r) C and 

a map h: B = S(xo,3*r) ^ SLfc(M) be given. Assume that the following two conditions 

hold: 

(0.1) for any subgroup F c Z*^, the function x ||/i(x)r|| is {C,a)-good on B, 
and 

(0.2) for any subgroup T c lJ^, sup ||/i(x)r|| > p. 
Then 

(0.3) VO < £ < p A({x e B I /i(x)Z'= i Ke]) < const(d, fc) • C Q A(B) , 

where the constant above is explicitly computable and depends only on d and k. 
Here and hereafter A stands for Lebesgue measure on M.'^. 

Roughly speaking, the informal meaning of this theorem is as follows. Under the presence 
of condition (0.1), which captures the polynomial-like behavior of the map h, one of the 
following two alternatives holds: either most of the 'orbit' h{B)lJ^ is contained in Kg, or 
there exists F C Z*^ which is 'responsible for the whole orbit being far away', namely, such 
that the covolume of /i(x)F is uniformly small for all x G B. 

Note that in the papers of Margulis and Dani the function h was of the form 
where {ux} was a unipotcnt subgroup of SL/c(]R) and g a fixed element of SLfe(M). Another 
class of important applications of Theorem 0.1 is to Diophantine approximation. There, to 
study Diophantine properties of almost every vector of the form f(x), where f is a map from 
W'- to ffi", one takes fc = n + 1 and considers 



(0.4) ft(x) 



e*/"/„ i I J„ 



In this case checking (0.1) amounts to establishing polynomial-like behavior of f, and (0.2) 
can often be extracted from global Diophantine properties of the image of f . The papers 
[KM, BKM, BBKM, Gl, K2-3, KLW, KT, KWl-3] contain various number-theoretic ap- 
plications of Theorem 0.1 and its generalizations. See also [Kl, Mr2] for a survey of this 
method and related results. 

The purpose of the present paper is to pay special attention to condition (0.2) of Theorem 
0.1. Namely, it is not surprising that the higher is the rank rk(F) of F (defined as the rank 
of F as a Z-module, or, equivalently, the dimension of Fr), the harder is usually the task 
of estimating ||/i(-)r|| from below. For example if F is of rank j and /i(x)F is generated by 
i orthogonal vectors of length p, one has ||/i(x)r|| x p> . Indeed, in some cases relevant to 
Diophantine applications one can prove estimates of the following form: 

(0.2') for any F C Z^ sup ||/i(x)F|| > p'^^^^ . 

This leads to a natural question - whether or not such a condition is enough for deriving 
(0.3). It became clear to the author in the process of working on the paper [K2] that an 
affirmative answer to the latter question would make it possible to significantly generalize 

several key results from that paper. 



The following theorem provides such an answer: 

2 



Theorem 0.2. Let d, fc G N, C,a,p > 0, and suppose that a ball B = B{xq, r) C M"^ and a 
map h : B =^B(xo,3'=r) ^ SLfe(R) satisfy (0.1) and (0.2'). Then (0.3) /lo/rfa. 

We will comment on the geometric significance of the improvement of Theorem 0.2 over 
Theorem 0.1 at the end of §2, after a more general result is proved. 

Let us now describe some number-theoretic applications of the above theorem. For m, n G 
N, Mm,n will stand for the space of real matrices with m rows and n columns. Define the 
Diophantine exponent uj{A) of ^ G Mm,n (sometimes called 'the exact order' of A) to be 
the supremum oi v > for which there are infinitely many q G Z" such that 

(0.5) Pq + p||<||q||-'' 

for some p G Z™. Note that this quantity is independent of the choice of norms on 
and R" (thus we will repeatedly switch between Euclidean and supremum norms whenever 
it is convenient). It is well known and easy to see that one has n/m < u){A) < oo for all 
A G Mm,n, and uj{A) = n/m for A-almost every A G Mm,n- 

We will specialize to the case m = 1, that is, consider Diophantine properties of y € 
M" interpreted as row vectors (see however §6.3 for the column vector set-up). Further, 
our emphasis will be on Diophantine approximation with dependent quantities, where the 
dependence is expressed by means of a Borel measure fi on R". Namely, let us define the 
Diophantine exponent uj{fi) of /i to be the /U-essential supremum of the function uj{-); in 
other words, 

a;(/i) =^ sup { w I /i({y | a;(y) > v}) > 0} . 

Clearly it only depends on the measure class of fi. If fi is naturally associated with a subset 
M of M" supporting (for example, if is a smooth submanifold of M" and /U is the 
measure class of the Riemannian volume on A4 <^=> the pushforward f*A of A by any 
smooth map f parametrizing A4), we will define the Diophantine exponent lu{A4) of A4 to 
be equal to that of fi. Clearly a;(/x) > n for any fi, and a; (A) = a;(M") is equal to n. The 
latter justifies the terminology introduced to Diophantine approximation on manifolds by 
V. Sprindzuk: a measure /i on R" (rcsp., a submanifold A4 of R") is extremal if a;(/Lt) (resp., 
Lo{A4)) is equal to n, that is, attains the smallest possible value. 

It was conjectured by Sprindzuk in 1980 [Sp] and proved in [KM] that real analytic 
manifolds not contained in any proper affine subspace of R" are extremal. More generally, 
let us say that a differentiable map f : i7 — > R", where U is an open subset of R'', is 
nondegenerate in an affine subspace £ of R" at x G f7 if f(J7) C jC and the span of all the 
partial derivatives of f at x up to some order coincides with the linear part of C. If is a 
d-dimensional submanifold of C, we will say that A4 is nondegenerate inCaty&M. if any 
(equivalently, some) diffeomorphism f between an open subset U of R** and a neighborhood 
of y in is nondegenerate in C at f~-'^(y). We will say that i : U ^ C (resp., M. G C) 
is nondegenerate in C if it is nondegenerate in L at A-a.e. point of U (resp., oi M., in the 
sense of the smooth measure class on M). 

The next theorem generalizes some of the results of [KM] and [K2]: 

Theorem 0.3. Let C he an affine subspace ofW^, and let Ad be a submanifold of C which 
is nondegenerate in C Then 

u^{M) = uj{C) = inf{a;(y) | y G £} = inf{a;(y) \y&M}. 

In short, Diophantine exponents of affine subspaces are inherited by their nondegenerate 
submanifolds. This was proved in [KM] for C = R", and in [K2] for extremal C (that is, 
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with 0-!{C) = n). Note that the middle cquahty is triviahy satisfied for C = W\ but is not 
at all obvious for proper subspaces C Indeed, it states that the infimum of wj^ coincides 
with its essential supremum; that is, the existence of a single point y G £ with ui{y) < v 
forces the set {y G £ ] ti>'(y) < v} to have full measure. 

Another natural problem addressed in the paper is computing Diophantine exponents 
of afiine subspaces in terms of the coefficients of their parametrizing maps. If C is an s- 
dimensional affine subspace of M", by permuting variables one can without loss of generality 
choose a parametrizing map of the form x ^ (x, xA'+elq), where A' G Ms_n-s and ao G M""'' 

(here both x and ao are row vectors). It will be convenient to denote the matrix ( ) by 



A G Mg+i^n-s, so that £ is parametrized by 

(0.6) xi-^.(x,xA), 

where x stands for (l,x). 

One of the advantages of such a parametrization is a possibility to relate Diophantine 
properties of A to those of points of C. Indeed, following [K2], it can be easily shown that a 
good approximation to A gives rise to a good approximation to all points of £ simultaneously 
(see Lemma 5.4). Consequently, uj{A) is a lower bound for w(£); thus 

(0.7>) (j{C) > max ( u!{A), n) . 

Estimating w(£) from above is a more difficult task. We accomplish it in §5 by writing a 
precise expression for io{£) parametrized as in (0.6) in terms of A (see Corollary 5.2). In 
particular, we prove 

Theorem 0.4. If a proper affine subspace L ofW^ is parametrized as in (0.6), where either 
(a) all the columns or (b) all the rows of A are rational multiples of one column (resp., row), 
then (0.7>) turns into equality; that is, one has 

(0.7=) a;(£) = max(a;(^),n) . 



This generalizes [K2, Theorem 1.3]. Examples of subspaces described by Theorem 0.4 

include: those parallel to coordinate subspaces; lines passing through the origin; subspaces 
of codimension one. Whether or not (0.7=) holds in general is an open question, see §6.1-2 
for discussion. 

We remark that the main results of this paper. Theorems 2.1 and 2.2, are much more gen- 
eral than Theorem 0.2. Namely, we consider maps from Besicovitch metric spaces equipped 
with Federer measures; see §1 for definitions. Thus our main Diophantine result, Theorem 
1.3, is substantially more general than Theorem 0.3. In particular, its framework includes 
fractal subsets of K" or, more generally, measures of the form f,/i where fi satisfies certain 
decay conditions, as in [KLW]. In §1 we review all the necessary terminology and background 
facts, and in §2 prove the general quantitative nondivergence estimates. Section 3 describes 
a connection between Diophantine approximation and dynamics which makes it possible to 
apply Theorem 2.2 to Diophantine exponents. Theorem 1.3 is proved in §4 and Theorem 
0.4 in §5. In fact, the Diophantine exponents of arbitrary affine subspaces are expressed 
in terms of so-called higher order Diophantine exponents of matrices, which are introduced 
and studied in detail in §5. The last section contains several open questions and further 
generalizations of the Diophantine problems considered in the paper. 

Acknowledgements: The author is thankful to Gregory Margulis, Barak Weiss and the 
reviewer for their valuable comments. This work was supported in part by NSF Grant 
DMS-0239463. 
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1. Preliminaries (Besicovitch, Federer, good, nonplanar) 
and the main diophantine results 



A metric space X is called N-Besicovitch if for any bounded subset A o£ X and any 
family B of nonempty open balls in X such that each x € A is a, center of some ball of B, 
there is a finite or countable subfamily {Bi} of B covering A with multiplicity at most N. 
We will say that X is Besicovitch if it is A^-Besicovitch for some N. The fact that M"^ is 
Besicovitch is the content of Besicovitch's Covering Theorem [Mt, Theorem 2.7]. 

Let /i be a Radon measure on X, and U an open subset of X with i-i{U) > 0. Following 
[KLW] , let us say that fj, is D-Federer on U if 



Equivalently, one can replace '3' in (1.1) by any c > 1, appropriately changing the value 
of D. This explains why Federer measures are often called doubling. It will be useful to 
have a nonuniform version of the above definition: we will say that /i as above is Federer if 
for ^-a.e. x G X there exists a neighborhood U of x and D > such that /i is D-Federer 
on U. Many natural examples of measures, including those supported on fractals, can be 
shown to be Federer. 

For a subset B of X and a function / from X to a normed space with norm || • ||, we let 

ll/lls "^^^sup^g^ |l/(a;)[|. If /X is a measure on X and B is a subset of X with /i(B) > 0, we 
define ||/||^,_b to be equal to ||/||snsuppfi- 

A function / : A M is called (C, a)-good on U C X with respect to ji if for any open 
ball B cU centered in supp fj, one has 



Informally speaking, a function is good if the set of points where it takes small values has 
small measure. We refer the reader to [KM, BKM, KLW, KT] for various properties and 
examples. One; of the elementary observations is conveniently stated below: 

Lemma 1.1 [KLW, Lemma 4.1]. Suppose that fi, . . . , fe are (C, a)-good on U with respect 

to PL. Then {ff + ■ • • + f^Y^"^ is (£"/^C, a)-good on U with respect to fi. 

In the situations when [/ is a subset of K'' and /i = A, we will omit the reference to the 
measure and will simply say '/ is (C, a)-good on [/', as has already been done in (0.1). In 
that case one can replace ||/ H^.s in (1.2) by and not pay attention to the restriction 

of the center of B lying in the support of the measure. 

(C, a)-good functions often come in families. For example, condition (0.1) used in The- 
orems 0.1 and 0.2 asserts that functions of the form x i-^ ||/i(x)r||, where F runs through 
subgroups of Z'' . are all (C, Q;)-good with uniform C and a. Often we will need to check 
the (C, a)-good property for functions from a given finite-dimensional function space. An 
example: polynomials in d variables are (C, a)-good on M** with C and a depending only 
on d and the degree of the polynomial. 

The following definition was introduced in [K2] and [KT]. Let f = (/i, ...,/„) be a map 
from a metric space X to M" and fi a measure on X. We will say that (f , n) is good at x e X 
if there exists a neighborhood V of x and positive C, a such that any linear combination of 
1, /i, . . . , /„ is (C, a)-good on V with respect to fi. We will simply say that (f , /.i) is good 



(1.1) 



sup — 

xEsupp r>0 
B{x,3r)(ZU 




(1.2) 



V£>0 n{{xGB\\f{x)\<s}) <C 
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if it is good at /i-almost every point. Again, the reference to the measure wiU be oniittcd 
when /Lt = A, in which case we will say that f is good or good at x. For example, we will say 
that polynomial maps are good (in fact, good at every point). More generally, based on the 
work done in [KM] , the following was proved in [K2] : 

Lemma 1.2. Let C be an afpne suhspace of R" and let i he a smooth map from open 

U C M"^ to C which is nondegenerate in C at x. ^ U . Then f is good at x. 

For a subset M of M", define its affiiie span (M)^ to be the intersection of all affine 
subspaces of M" containing M. Then it is easy to see that C in the above lemma is equal 
to {f{B))a for some open B 9 x. It will be useful to define a similar property for more 
general maps and measures. Namely, let X be a metric space, jj, a measure on X, C an 
afRne subspace of R" and f a map from X into C. Say that (f , /x) is nonplanar in C (cf. 
[KT, §4] and [KW3, §1]) if 

(1.3) C = (f(-B n supp/u))a V nonempty open B with /Lt(-B) > . 

As before, we will omit the dependence on fj, (resp., £) when fj, = X (resp., C = R"). 
Clearly (f , /x) is nonplanar iff for any nonempty open B of positive measure the restrictions 
of 1, /i, ...,/„ to B n supp /i are linearly independent over M.. 

As was said before, a basic example is given by nondegenerate smooth maps from R*^ to 
M" : it is clear from the definition that nondegeneracy in £ implies nonplanarity in jC. Thus 

the following statement generalizes Theorem 0.3: 

Theorem 1.3. Let fi be a Federer measure on a Besicovitch metric space X, L an affine 
suhspace ofW^, and letf : X ^ C he a continuous map such that (f , /x) is good and nonplanar 
in L. Then 

a;(f*/x) = w(£) = inf{a;(y) | y G £} = inf {a;(f(a;)) | x G supp /x} . 

The special case £ = M" was (in a slightly different terminology) one of the main results 
of [KLW]. Note that in all the applications considered in this paper we will take X to be 
an open subset of K'^; however one can also work with vector spaces over other local fields 
and, using methods from [KT] and [G2] , obtain non- Archimedean version of many results 
from the present paper; see §6.6 for further discussion. 

Many nontrivial examples of measures n and maps f satisfying the conditions of the above 
theorem can be found in the paper [KLW]. For example, it is not hard to see that a measure 
on M" is friendly (a property introduced in [KLW]) iff it is Federer and (Id, ii) is good and 
nonplanar (here Id is the identity map R" — > M"). Many measures naturally arising from 
geometric constructions can be shown to possess an even stronger property - such measures 
were referred to as 'absolutely decaying and Federer' in [KLW] and as 'absolutely friendly' 
in [PV]; many examples of those can be found in [KLW, U, SU]. It was proved in [KLW, §7] 
that if /i is absolutely decaying and Federer and f is nondegenerate at ^-a.e. point of R'', 
then (f, /i) is good and nonplanar. From the aforementioned facts, using Theorem 1.3 and, 
if necessary, compositions with affine isomorphisms, the following can be deduced: 

Corollary 1.4. (a) Let £ he a d-dimensional affine subspace o/ R", let fi be a friendly 
measure on , and let i : ^ C be an affine isomorphism. Then (^(f^/i) = t^(£)- 

(b) Let fi be an absolutely decaying and Federer measure on , C an affine suhspace of 
R", and let i -.W' ^ C he a smooth map which is nondegenerate in C at fjb-a.e. point ofW'-. 
Then co{f^,^) = uj{C). 

For the special case of £ being extremal, part (a) was stated without proof in [KLW, 
§10.5]. 
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2. Weighted posets and quantitative nondivergence 

In this section wc work with mappings of weighted partiaUy ordered sets (posets) into 
spaces of functions on balls in a Besicovitch metric space. Here is some relevant terminology. 
By a weighted poset we mean a partially ordered set ^ together with a map rj : ^} ^ M+. 
A linearly ordered subset of *p will be called a Bag. Wc will denote by the length of 'P, 
i.e. the cardinality of a flag with maximal cardinality. If 5 is a subset of ^, we let be 
the poset of elements of ^ \ ^ comparable with every element of 5- Note that one always 
has 

(2.1) e{m))<m-m)- 

We will fix a metric space X and consider a weighted poset (Cp, rj) together with a mapping 
ijj from ^ to the space C{B) of R- valued continuous functions on some subset B oi X, which 
we will denote hy s i—>- -tps. Given such a mapping and a positive number s, we will say that 
a point z G B is e-markcd relative to *P if there exists a flag 5z C Cp such that 

(Ml) £r?(s) < |^,(z)| <rK.s) VsGdz', 

(M2) \Mz)\>vis) Vse<p(i?,). 

We will denote the set of all such points by $(e, ?P). When it does not cause confusion, we 
will omit the reference to either ry or e, and will simply say that z is £-marked, or marked 
relative to ^. 

Theorem 2.1. Let k,N ^ Z+ and C,a,D > 0. Suppose that we are given an N -Besicovitch 
m.etric space X, a weighted poset (*p, 77), a hall B — B{x,r) in X, a measure /i which is 

D-Federer on B i?(a;, 3™r), and a mapping V' : ^5 — > C{B), s i-+ tp^, such that the 
following holds: 

(AO) em < k; 

(Al) Vs e *p , ips is (C, a) -good on B with respect to jj.; 
(A2)VsGq3, mUB>r,{s); 

(A3) yy€B n supp/i, #{s e <P I |Vs(2/)| < v{s)} < 00. 
Then V e > one has 

n{B \ $(£,*P)) < kC{ND'^)''e'^n{B) . 

The proof given below is a weighted modification of the argument from [KT, §5], which, 
in its turn, generalizes [KM, §4]. In a sense, this modification allows one to use the full 
strength of the construction originally introduced by Margulis [Mrl] , obtaining what may 
be considered as the optimal result (see the discussion after the proof of Theorem 2.2). 

Proof. Wc proceed by induction on k. If fc = 0, the poset *P is empty, and for any z E B 
one can take dz = ^ and check that (Ml) and (M2) are satisfied for all positive e; thus all 
points of B are marked. Now take fc > 1 and suppose that the claim is proved for all smaller 
values of fc. 

Fix C,a, p,^i, B = B{x,r) and tp as in the formulation of the theorem. For any y € 
B n supp/U define 

Hiy)''^'{sGV\\My)\<vis)}; 

this is a finite subset of ^ in view of (A3). If H{y) is empty, y is clearly £- marked for any 
positive e: indeed, since \ijjs{y)\ > ri{s) for all s G *p, one can again take to be the empty 
set and check that (Ml) and (M2) are satisfied. Thus one only needs to consider points y 
from the set 



supp/x I Hiy) ^ 0} — {2/ G i? n supp/x I 3 s G with \'ipa{y)\ < ^(s)} 
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Take y ^ E and s G H{y), and define 

def I 

(2.2) rs,y = sup{t > I \\ips\\ij.,B(y,t) < l{s)} ■ 

It follows from the continuity of functions ips that for small enough positive t one has 
IIV's||/j,B(j/,t) < vis), hence rs,y > 0. Denote B{y,rs,y) by Bg^y. From (A2) it is clear that 
Bs,y does not contain B; therefore one has rs,y < 2r. Note also that (2.2) immediately 
implies that 

(2.3) <??(s). 

Now for any y E E choose an element Sy of H{y) such that ^y > Vg^y for all s G H{y) 
(this can be done since H(^y^ is finite). For brevity let us denote T^^^y by Vy and Bg^^y — 
\JaeH{y) ^s,y by By. Also let us denote the poset ^({sy}) by ^y. 

The next claim allows one to show a point z G By to be marked relative to ^ once it is 
marked relative to Namely, fix e > and y G E, and take z such that 

(2.4) zGBy supp n $(£,*Py) and \tsy{z)\ > er]{sy) . 
Then we claim that z belongs to $(e,^); equivalently, 

(2.5) {By n suppM) \ $(e,^) C {By \ $(e,«Pj,)) U {x G By | |^,^(a;)| < sr]{sy)} . 

Indeed, take z as in (2.4); by definition of (l?(e,^y), there exists a flag ^y^z C such 
that 



(2.6) ev{s) < \^s{z)\ < r,{s) Vs G dy,z 
and 

(2.7) |V.(2)l>^(s) VsG«P,(i5,,,). 

Flit ^z =^ U {sy}. Then q3(5^^) = ^5j;(S^j;,^), so (M2) immediately follows from (2.7). 
Property (Ml) for s Sy is given by (2.6), and for s = by (2.4) and (2.3). Thus (2.5) is 
proved. 

Note that for any y E E one clearly has ry < 2r, which in particular implies that 
By c B{x,3r). We are going to fix some r'y strictly between ry and min(2r, Sr^), and 
denote B{y,r'y) by By. Clearly one has 

(2.8) IIV'slU.B^ > V{s) for any y G £; and s G qj. 

(Indeed, the definition of ry and (2.2) imply the above inequality for any s G H{y), and it 
obviously holds if s ^ H{y).) 

Now observe that ^iy, By and By B(y, 3*'~^r^) satisfy properties 

• (AO) with A; replaced by fc — 1 — in view of (2.1); 

• (A2) — in view of (2.8); 

• (Al) and (A3) — since 
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B'y = B{y, 3'=- V;) C B{x, 3*=" + r) C B{x, (2 • 3'="^ + l)r) C B{x, 3'=r) = B . 
Therefore one has 

X $(£, < ^(i?; X $(£, < (fc - i)c(7vz?2)'=-i£X4) 

< L»(A: - l)C{ND'^)''-^e''ij{By) 

by the induction assumption and the Fcdcrcr property of /j,. On the other hand, in view of 
V'sj, being (C, Q;)-good on B D By with respect to fi, one can write 

II {{x G Bj, I IVs.WI < £V{sy)}) <fi{{xeB'y\ \^IJs,{x)\ < sr]{sy)}) 

\\\Vsy\\iJ.,B'^ J (2.8) " Federer 

Now recall that we need to estimate the measure of £^ \ $(e, ^P). For any y G -E, in view of 
(2.5), (2.9) and (2.10) one has 

KBy \ He, V)) <C{{k- 1)N'^-'D'''-' + D) s'^n{By) 
< kCN''-^D^''-h°'ii{By) . 

Finally, c;onsidcr the covering {By \ y G E} of E, choose a countable subset K of £^ such 
that the multiplicity of the subcovering {By | y G F} is at most A'", and write 

(2.12) ^ liiBy) < Nii{ y By) < Nfi{Bix, 3r)) < NDiiiB) . 

yeY y&Y 

Therefore the measure oi E \ $(£, *P) is bounded from above by 

yeY (2-") yev 

< kCiND^fe"ii{B), 

(2.12) 

and the theorem is proven. □ 

We now apply Theorem 2.1 to the (appropriately weighted) poset 

*Pfe =^ {nonzero primitive subgroups of Z'^} , 

with the inclusion relation (recall that a discrete subgroup F C Z*^ is called primitive if 
r = Fm n IJ"), and prove a general version of Theorem 0.2. 

Theorem 2.2. Let k,N and C, D,a,p > 0, and suppose we are given an N-Besicovitch 

metric space X, a ball B = _B(a;o,ro) C X, a measure fi which is D-Federer on B = 
i?(xo, S'^ro) , and a map h: B ^ GLfc(K). Assume that the following two conditions hold: 

[2.2-i] VF G CPfe, the function x ^ \\h{x)T\\ is (C,a)-good on B w.r.t. fi; 
[2.2-ii] VF G ^k, \\h{-)r\\^,B > . 

9 



Then for any positive e < p one has 

li{{x e B I /i(a;)Z'= ^ K^}) < kC{ND'^)^ Q ijl{B) . 

Proof. Wc let «P = «Pfe and for all T G «P define Vr(-) l|/i(-)r|| and r]{T) = p'^^'^\ It is 
easy to verify that 77) and satisfy properties (A0)-(A3) of Theorem 2.1. Indeed, (AO) 
holds since any two primitive subgroups are either incomparable or have the same rank, 
(Al) is given by [2.2-i] and (A2) by [2.2-ii]. To check (A3) it suffices to observe that for any 
X, since /\ (/i(a;)Z'^) is discrete in R*^, the number of primitive subgroups F of Z*^ for which 
||/i(x)r|[ < 1 is finite. 

In view of Theorem 2.1, it remains to prove that a point x G B with h{x)lJ^ ^ K^^ cannot 
be --marked; in other words, 

(2.13) ^(e//5,*P) C {x e B I ||v|| > e for all v e Z'= \ {0}} . 

Take an ^-marked point x B, and let {0} = Fq C Fi C • • • C Tj = Z*^ be all the elements 
oidx U {{0},Z''}. Properties (Ml) and (M2) translate into: 

(2.14) ^•/'^(^')<||M^)ri||<P*(^') Vi = 0,...,j-1, 
and 

(2.15) ||/i(a;)F|| > /'^(r) vrG'P(S,). 

(Even though Fq — {0} ^ ?P, it is clear that it also satisfies (2.14).) 

Take any v G Z*^ \ {0}. Then there exists i, 1 < i < such that v G F^ \ Fi_i. Denote 
(Fi_i + Zv)r n ll' by F. Clearly it is a primitive subgroup of iJ^ satisfying Fi_i C F C Fj, 
therefore F G S^cc U ^{^x)- Now one can use properties (2.14) and (2.15) to deduce that 

(2.16) \\h{x)V\\ > min ■ p'-'^(r)^ prk(r)^ ^ = ep>-k(ri-i) . 

On the other hand, from the submultiplicativity of the covolume it follows that ||/i(x)F|| < 
\\h{x)Ti_4 ■ \\h{x)w\\. Thus 

nW N „ ll^(^)r|| £pMT^-l) 

\\h{x)v\\ > r\' " > =e. 

||/l(a;)Fj_i|| by (2.14) and (2.16) p'^^^^'-i^ 

This shows (2.13) and completes the proof of the theorem. □ 

In order to better understand the difference between this theorem and its predecessors 
(proved in [KM, KLW, KT]), let us draw a corollary from it. Namely, suppose that X, jj, and 
B are as in the above theorem, that h satisfies [2.2-i] with some C, a (for example, X = W^, 
/U = A and his a. polynomial map), and that for some small positive e, the relative measure 
of a; G B for which h{x)lJ^ ^ is at least 1/2. Then Theorem 2.2 asserts that there 

exists a subgroup F of Z'^ such that ||/i(-)F||^,b < f>'^^^\ where p = (2kC{ND'^)''y^°' s. 

Consequently, in view of Minkowski's Lemma, the whole 'trajectory' h{B O supp/i)Z*' must 
be contained in the complement to -ft'const -s- with the constant depending only on D, TV, C, a 
and k. In other words, it must stay at a (uniformly) bounded distance from the complement 
to K^. Note that using previously known results it was only possible to conclude that 
/i(i3nsupp^)Z*' must be outside of -fi'const •£i/('=-i) > ^ compact set of diameter approximately 
(fc — 1) times smaller than that of K^. 

When it comes to number-theoretic applications, the crucial advantage is that replacing 
p by makes [2.2-ii] easier to check. This will be demonstrated in the next section, 

where Theorem 2.2 will be applied to h as in (0.4). 
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3. An application to metric Diophantine approximation 



We recall some notation and terminology introdnccd in the beginning of this paper, as 
well as in the paper [K2]. For m, n G N and i; > 0, we denote by yV„ the set oi A & Mm,n 
for which there are infinitely many q G Z" such that (0.5) holds for some p G Z™. The 
dimensionality of the matrices will be clear from the context. The Diophantine exponent 
w{A) of A defined in the Introduction is equal to 

Lj{A) = sup{v \AgWv}. 

Clearly W„ C Wy ii u> v. We will also use the notation 

W: =' U W„ = I io{A) > v} . 



Note that the definition of co{A), unlike that of the sets Wy, does not depend on the choice 
of norms on R"* and M". 

Although there are many interesting and unsolved Diophantine problems rc;lated to the 
space of m X n matrices, we specialize to the case m = 1, that is, consider Diophantine 
properties of vectors row matrices) y G M". With some abuse of notation, we will view 
integer vectors q e Z'' as column vectors, so that yq stands for yiqi + ■ — h ynQn- 

Now let us d(^scribc! a correspondence, dating back to [Scl] and [Da3], between approxi- 
mation properties of vectors y G K" and dynamics of certain trajectories in fi„+i. Given a 
row vector y G K" one defines 



(3.1) Uy = 



del f I y 



/„ 

that is, the collection of vectors of the form 



where p G Z and q G Z". Then one can read Diophantine properties of y from 



yq + p 
q 

the behavior of the trajectory i^UyZ"+^ in the space of lattices, where 



(3.2) F = {gt\t>0}, with 5t = diag(e*,e-*/",...,e-*/"), 

is a one-parameter subsemigroup of SL„+i(M) which expands the first coordinate and uni- 
formly contracts the last n coordinates of vectors in M"+^. 

The following elementary lemma was proved in [K2]. 

Lemma 3.1. Suppose we are given a set E CM.'^ which is discrete and homogeneous with 
respect to positive integers, that is, kE c E for any A; G N. Also take a,b > 0, v > a/b, and 
define c by 

bv — a a + c 

c= <^ v=- . 

V + 1 b — c 

Then the following are equivalent: 

[3.1-i] there exist {x,z) G E with arbitrarily large \z\ such that \x\ < \z\^'" ; 
[3.1-ii] there exist arbitrarily large t > such that for some {x, z) € E \ {0} one has 



max e ix\,e 
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Taking u > n, y G M" and 

E={{yq + p,M) I (p,q)eZ"+i}, 

one notices that [3.1-i] is equivalent to y G Wy On the other hand, choosing a = 1 and 
b=l/n one sees that [3.1-ii] amounts to 

(3.3) gtUyZ"'^^ ^ Kg-ct for an unbounded set of f G M+ , 
where 

/o .\ V - n n(l + c) 

(3.4) c=— — <4> w = -r ^. 

n(w + 1) 1 - nc 

Thus, if we define 7(y) to be the supremum of all c for which (3.3) holds, or, equivalently, 

7(y) = sup {c I gtUylT'^^ ^ K^-va for infinitely many i G N} 

(in [K2] this quantity was called the growth exponent of WyZ"+^ with respect to F), then 
we have the equality 

(3.5) a;(y)=n-^ + ^^^^ 



1 - n7(y) 

(cf. [K2, Corollary 2.3]). 

Now let us turn to computing Diophantine exponents of measures. If is a measure 
on M" and v > n, one has u{u) < w iff i^(>V„) = for any u > v. In view of the above 
discussion, this amounts to saying that for any d > c where c is given by (3.4), 

1/ ({y l^tUyZ""*"^ ^ Kg-dt for an unbounded set of t G R+ }) = . 

This is easily seen to be equivalent to 

V d > c , T^{{y |s'tUyZ"+^ ^ Kg-dt for infinitely many t G N }) = , 

and, in view of the Borel-Cantelli Lemma, a condition sufficient for the latter is 

oo 

(3.6) ^u{{y\gtUyZ"-+^ ^ K^-dt}) <oo Md>c. 

t=i 

This is precisely where the measure estimates discussed in the previous section come in. 

Proposition 3.2. Let X be a Besicovitch metric space, B — B{x,r) d X a ball, ii a 
measure which is D-Federer on B =' B(x,2)^'^^r) for some D > 0, and f a continuous map 
from B toW^. Also take c > and assume that: 
[3.2-i] 3C, a > such that all the functions x i— > ||g'tUf(j;)r||, F G ^n+i, {C,a)-good on 
B w.r.t. ii; 

[3.2-ii] for any d > c there exists T = T{d) > such that for any t >T and any T G ^n+i 
one has 

||flt%(.)r|U,B>e-^(r)*. 
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Then a;(f*(/x|s)) < v, where v is related to c via (3.4). 

Proof. As was observed in the course of the preceding discussion, it suffices to show (3.6) 
for v = {^,[ij,\b). We now proceed to verify that the map h = gtUf satisfies the assumptions 
of Theorem 2.2, with k = n + 1. Condition [2.2-i] clearly coincides with [3.2-i]. For the other 
condition, we take d> c and p = e~^*, so that [3.2-ii] implies [2.2-ii] for any t > T{^^). 
Therefore, by Theorem 2.2, 

u ({y |5tMyZ"+^ ^ K^-4t }) = n{{x G B I /i(a;)Z"+^ ^ K^-<i^}) 

< (n + l)C(7VD2)"+' (e-*e^*)%(B) 

= const -e""^* 

for all but finitely many t €N. This readily implies (3.6). □ 

Remark 3.3. Note that both [3.2-i] and [3.2-ii] trivially hold for T of rank and n+1, and 

also that the validity of those conditions for all primitive T is equivalent to their validity 
for all subgroups. It will be convenient to denote by Skj the set of all subgroups of Z'' of 
rank j. Thus, to apply Proposition 3.2 it will be enough to check [3.2-i,ii] for all F e Sn+ij, 
j = l,...,n. 

4. Computing Diophantine exponents of measures 

In this section we use Proposition 3.2 to prove Theorem 1.3, that is, for a given v > n, 

write necessary and sufficient conditions for Diophantine exponents of certain measures to 
be not greater than v. For this we need to understand to what extent the two conditions 
in the above proposition are necessary. While not much can be said about the first one, it 
turns out that assmnption [3.2-ii] is in fact necessary for the conclusion of Proposition 3.2. 
Furthermore, the consequences of [3.2-ii] not being true are surprisingly strong. 

Lemma 4.1. Let fi be a measure on a set B, take c,v > Q related via (3.4), and let f he 
a map from B to M" such that [3.2-ii] does not hold. Then f{Bf\ supp /x) C W„ for some 
u> V. 

We remark that the negation of the conclusion of Proposition 3.2 is much weaker: it 
simply amounts to saying that for some u > v the set {x & B \ f{x) G >V„} has positive 
measure. 

Proof. The assumption of the lemma says that there exists d> c such that one has 

sup ||flt«f(,)r|| < e-^'^(r)'^* 

xG-Bn supp |i 

for arbitrarily large t (and F C Z""*"^ dependent on t). In other words, there exists 1 < J < J^, 
a sequence ti ^ oo and a sequence of subgroups Fj G Sn+i,j such that for any x G Bf] supp ji 
one has ||5tiWf(x)ri|| < e~^'^*'. But in view of Minkowski's Lemma, this means that for any i 
and any x € Bd supp fj, there is a nonzero vector v G gtiU{(x)^i with ||v|| < 2^e~'^*'. Hence 

which implies that 7(f(.T)) > d, finishing the proof in view of (3.5). □ 

It is worthwhile to point out that it is precisely the above argument that requires a 
strengthening of the quantitative nondivergence obtained in §2. Previously available tech- 
niques could only yield 7(f(a;)) > d/n, which was enough in the case v = n c = 0, but 
not in the general case. 
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To write [3.2-ii] in a more convenient form, instead of discrete subgroups T of M'^ we will 
work with elements w e A(^'^) representing them, saying that w represents T ^ {0} if 
w = vi A • • • A Vj where vi , . . . , Vj is a basis of F as a Z-module. Clearly w representing F 
is defined up to a sign, hence ||r|| = ||w|[. Now let us reproduce a computation (first done 
in [KM]) of coordinates of gtUt(^x)^ with respect to the standard basis of /\(R"+^). For the 
rest of this section let us denote by V, its standard basis by eo,ei, . . . ,e„, and the 

space spanned by ei, . . . , e„ by Vq. For I = {ii, . . . c {0, . . . , n}, ii < i2 < ■ ■ ■ < ij, let 

e/J^ei, A---Ae,^ & A\V) , 

with the convention = 1; then {e^ | #/ = j} is a basis of A"'(^)' ^^'^ '^^ extend the Eu- 
clidean structure (•, •) from V to its exterior powers so that this basis becomes orthonormal. 
Since the action of Uy leaves eo invariant and sends e^, i > 0, to ej + yieo, one can write^ 

re, if G / , 

("^-l) Uyej=<. + ^(ei Ae,^{i},e,)t/je,u{o}^{i} otherwise. 

I iei 

Therefore w G /y"* (V) is sent to 



(4.2) UyW= ^ {ej,w)er+ ^ ^(eiAe^.w) 




(where it is understood that = j- i^J = j — ^, and yo = 1). Note that the first sum 
in (4.2) is the image of w under the orthogonal projection from /\' (V) onto A"'(^o), which 
(the projection) we will denote by tt. On the other hand, each term in the second sum 
is orthogonal to /\{Vo). To simplify (4.2), let us define a linear map c from /\'' {V) to 
( A''"^ (^))"^^ by setting the ith component of c(w), i = 0, 1, . . . , n, equal to 

(4.3) c(w)i ^ (e, Aej,w)ej. 

,/C{l....,n} 

For example, the choice i = gives c(w)o = ^i^m, so that 

(4.4) eo A c(w)o = w — 7r(w) . 

In particular, the kernel of w i-^ c(w)o is A"'(^o), that is, the orthogonal complement to 
eo A A''^ (Mj)- Likewise, for every i the kernel of w i-^ c(w)j is the orthogonal complement 
to Bi A A"'~^(^)> which implies that the kernel of c is trivial. 
With this notation, (4.2) can be rewritten as 

UyW = 7r(w) + eo A yc(w) , 

where y stands for the row vector (1, yi, . . . , t/„), and the product in yc(w) is the formal ma- 
trix multiplication (of the row vector y G and the column vector c(w) G ( A(^))"^^)- 
In other words, 

n n 

UyW = 7r(w)-|-eoA^j/jc(w)i = w + eo A ^ yic(w)j . 

i=0 ^^'^^ i=l 



'^The quantity (e; A e/^{»},e/) in (4.1), which is equal to 1 (resp., —1) if the number of elements of / 
strictly between and i is even (resp., odd), was denoted by (-lyC'^) in [K2]. 
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Since A''(^) ^^'^ its orthogonal complement are eigenspaces of gt with eigenvalues e ^* 
and e" " respectively, one can write 

(4.5) gtUyMv = e~"*7r(w) + e^^^"^^* eo A yc(w) ; 
thus, up to a uniform constant, 

(4.6) ||5tnf(.)rt,B = max(e'^*||f(-)c(w)|U,s , e-^*||7r(w)||) , 

where F G iS„+i,j is represented by w, and we use the notation f =^ (1, /i, . . . , /„). 

Observe that from the above one can already extract a nice lower bound for (4.6) whenever 
the restrictions of 1, /i, . . . , /„ to B fl supp jj, are linearly independent. Indeed, then the 
correspondence v Ijf (Ovjl/^.s yields a norm on ( A(^))"~'" ^ which is obviously equivalent 
to the standard (Euclidean) norm. Thus, up to a (uniform in w) constant, the expression 

(4.6) is bounded from below by ||c(w)||. Since c has trivial kernel and maps /\{Vz) into 

(/\(Vz))"^^, one has ||c(w)|| > 1 for any nonzero w G A(^)- This argument appears in 
[KM] and, in a dual form, in [KLW]. 

In general the desired lower bound is affected by the linear dependence relations between 
the components of f. Namely, denote by ^fj.,B the M-linear span of the restrictions of 
1, /i, ...,/„ to B n supp /U, denote its dimension by s + 1, and choose functions gi,. . . ,gs : 
B n supp /Lt — > M such that l,gi, . . . ,gs form a basis of .F^.s. This choice defines a matrix 

j=0,...,n 

formed by coefficients in the expansion of 1, /i, . . . , /„ as linear combinations of 1, gi, . . . , gg. 
In other words, with the notation g =^ (1, 51, . . . , gs), one has 

(4.7) i{x)=g{x)R VxeSnsupp/i. 

Moreover, since the first components of f and g are equal to 1, the elements in the first 
column of R are 



(4.8) Ufl = 



1 if i = , 
otherwise. 



In view of (4.7), ||f (•)c(w)||^_b can be replaced by ||g(-)i?c(w)||^ and the latter, in 
view of linear independence of the components of g, simply by the norms of vectors i?c(w). 
Summarizing the discussion, we see that [3.2-ii] is equivalent to 

Vd > c 3T>0 such that yt>T, Vj = 1, . . . , n and V w G S^+ij 
^^■^^ onehas max (e'^*||i?c(w)||, e--*||7r(w)||) > e"^* , 

where we have identified Sn+ij with the set of elements of /\-' {W^'^^) representing T e 

Here is another way to understand the above condition. Let — {vi^o, . . . , ri_„) stand for 
the ith row of R, i = 0, . . . s. Then, using (4.3), one can write the ith component of Rc{w) 
in the form 

n 

[Rc{w)]i = ^ ri,fe ^(cfc A Cj, w) 
(4.10) '==° „ 

= X] ( X] A Cj, w) ej = ^(r, A e^, w) , 

J fe=0 J 
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and therefore, up to a uniform constant induced by replacing the Euclidean norm with the 
sup norm, 

(4.11) ||i?c(w)|| = max max |(ri A ej,w)| . 

i=0,...,s JC{1, •••>"} 

At this point it becomes useful to recall Lemma 3.1. Namely, for each j = l,...,n 
consider 

^={(||i?c(w)||,||7r(w)||) I wg5„+i,,-}. 

It is clearly homogeneous with respect to positive integers, and the fact that it is discrete 
is easy from (4.11) and (4.8). Take a = and b = j^, and recall that in the beginning 

we fixed v > n and chose c = n^tj+i) ' ^^^^ (4-9) amounts to saying that for any c > cq 
j „"JI|!\) condition [3.1-ii] does not hold. By Lemma 3.1, this is equivalent to saying that 
[3.1-i] does not hold with v replaced by any real number greater than 

a + Cp _ n ^ ■) n{v+l) _ V + i- - 3 

b-co~ i- - j y-" ^ ~ j 

^ n J n{v+l) •' 

Therefore (4.9) becomes equivalent to 

Vj = 1, . . . ,n, Vu > 2i±i^ and Vw e 5„+i 

(4.12) ^ ^ 
with large enough II 7r(w) II , one has ||iic(w)|| > ||7r(w)|| 

As a result, we managed to get rid of an auxiliary variable t in (4.9) and found a way to 
directly involve v, rather than relate it to c via (3.4). 

Note that the only way the ball B, the measure n and the map f enter the above conditions 

is via the matrix i?, which depends on both BDsupp /i and f and is not uniquely determined 
- but another choice of R would clearly yield a condition equivalent to (4.9)<;=>(4.12). Here 
is another useful way to describe R. Let 

(4.13) £= (f(SnsuppM))a, 
put s = dim(£), and suppose 

(4.14) h : — > £ is an affine isomorphism, and h(x) = xR , x G , 

where as usual we have h =' (1, hi, . . . , /i„) and x '^'^ (1, xi, . . . , Xg)- Then it is clear that 

R and g o f satisfy (4.7), and that l,gi, . . . ,gg generate J-fj..B and are linearly 

independent over R. This way, condition (4. 12)^(4. 9)^[3.2-ii] becomes a property of the 
'enveloping subspace' {{{B (1 supp/i))a; in particular, R can be chosen uniformly for all 
measures /x, balls B intersecting supp fj, and maps f as long as (4.13) holds. 
We are now ready for the main result of the section. 

Theorem 4.2. Let ^ he a Federer measure on a Besicovitch m,etric space X , C an affine 
subspace o/K", and letf : X ^ C be a continuous map such that (f , /x) is good and nonplanar 
in C Then the following are equivalent for v > n: 

[4.2-i] [x € supp n I f{x) ^ Wu} is nonempty for any u > v; 
[4.2-ii] a;(f*/i) < v (in other words, each of the sets in [4.2-i] has full measure); 
[4.2-iii] (4.12) holds for some for any) R satisfying (4.14). 
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This was proved in [K2] for t; = n. 

Proof. Obviously [4.2-ii]^[4.2-i]. Assuming [4.2-iii] and using the facts that /i is Fcdcrcr 
and (f , /x) is good, one can conclude that /x-a.e. x € X has a neighborhood V such that /x is 
(C, a)-good and D-Fedever on V for some C,D,a > 0. Choose a ball B = B{x, r) of positive 
measure such that the dilated ball B — _B(x, 3""'""'^r) is contained in V ^ and note that (4.13) 
holds in view of (1.3). We have seen in (4.5) that for any w, each of the coordinates of gtUfW 
is expressed as a linear combination of functions 1, /i, . . . , /„. Therefore, in view of Lemma 
1.1, [3.2-i] will hold (perhaps with a different constant C), and [3.2-ii] was postulated in the 
disguise of (4.12), as shown by the discussion preceding the statement of the theorem. Thus 
Proposition 3.2 applies, and [4.2-ii] follows. 

On the other hand, saying that [4.2-iii] does not hold, in view of the same discussion, 
amounts to assuming that no ball B intersecting supp n satisfies [3.2-ii]. From Lemma 4.1 
it then follows that f(S fl supp /i) C W„ for some u> v, contradicting [4.2-i]. □ 

An especially remarkable feature of the above theorem is that whenever conditions [4.2- 

i,ii] hold for some X, fi and f satisfying the assumptions of the theorem, they hold for any 
X, /z, f satisfying those assumptions. Indeed, condition [4.2-iii] equivalent to them has no 
reference to f, jj, or X, only to C. In particular, one can make the most natural choice of 
X, fi, i; that is, put X = W, /i = A and f = h as in (4.14), thus establishing Theorems 1.3 
and 0.3, and furthermore producing a formula for the Diophantine exponent of C. Namely, 
one has 

Corollary 4.3. Let C be an s- dimensional affine subspace o/M". Then 
Lj{C) = max (n, sup{v \ (4.12) holds for R as in (4.14)}) . 

This will be made more explicit in the next section. 

5. Higher Diophantine exponents and Theorem 0.4 

In this section we start by fixing a parametrization (0.6) for an s-dimensional affine 
subspace £ of M". This amounts to taking R of the form 

= ='■(/,+! A) 

where A e Ms+i^n-s- In order to restate condition (4.12) in terms of A, let us denote by 
ai = {ui^s+i, ■ • ■ ) ai,n) the ith row of A, i = 0, . . . , s, and identify it with an element of V by 
putting aj = ai^k^k- Then, in view of (4.11), we have 

(5.1) lli?^c(w)|| = max max [((e^ -|- a^) A ej, w) | . 

i=0,...,s Jc{l,...,n} ' ' 

Corollary 4.3 asserts that for v > n, the Diophantine exponent of £ is not greater than v if 
and only if for all j = 1, . . . , n, u > ^"'"^"•^ and w € <S„+ij with large enough ||7r(w)||, the 

expression in (5.1) exceeds ||7r(w) || 

This condition however can be considerably simplified. Namely, put 

and denote by tt, the projection of A(^) to /\{V,). We will also be using the following 
notation: a; <C y will stand for x < Cy, where C depends only on the matrix A and not on 
w. 
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Lemma 5.1. Suppose that \\Rac{w)\\ is less than 1 for some w e Sn+ij- Then ||w|| <C 

i + lk.(w)||. 

Proof. Let us take / C {0, . . . , n} of size j and prove that the absolute value of (bj, w) is 
bounded from above by a uniform constant times 7r,(w). Denote by k the smallest element 
of I. The claim is trivial if fc > s. Otherwise, using (5.1), one can write 

|(ej,w)| < 1 + |(afc Aej^{fe},w)| 

and observe that, since a, C V, for each i, the right hand side is not greater than 1 + 
maxi ||ai|| • maxj^{j,+i „j |(ej,w)|. The same argument can be applied to each of the 
components (ej, w), and after no more than s additional steps the process will terminate. □ 

In particular, the lemma forces ||i?^c(vir)|| to be not less than 1 for any w € Sn+i,j with 
j > n — s and large enough 7r(w) (this was checked in [K2, Lemmas 4.5 and 4.6] for some 
special cases). Since 1 > ||7r(w)||^" whenever u > and v is at least j — 1, we can 

conclude that subgroups of rank greater than n — s have no impact on the Diophantine 
exponent of jC. 

It will be convenient to associate to A the following quantities: for each j = 1, . . . , n — s, 
define 



(5.2) (jj{A)=^sup' 



3w G Sn+i,j with arbitrary large ||7r,(w) 
such that ||i?^c(v^f)|| < ||7r.(w)||"^^^ 



By Lemma 5.1, (4.12) holds if and only if uij [A) < v for all j = 1, . . . , n — s. Thus Corollary 

4.3 can be rewritten as 

Corollary 5.2. For C parametrized by (0.6), w(£) = max ( 

We will refer to uij{A) as the Diophantine exponent of A of order j. The reason for this 
terminology is the observation, essentially made in [K2] , that 

Lemma 5.3. u)i{A) = u){A). 

Proof Take v = e Z"+i \ {0} = 5„+i,i in place of w, where p e and q e Z"-^ 

and observe that the only possible choice of J in (5.1) and (4.3) is J = 0. It follows that 
c(v) = V, ii^c(w) = p + Aq and 7r,(v) = q; hence the inequality in (5.2) coincides with 
(0.5). □ 

In view of the lemma, the estimate (0.7>) stated in the Introduction gets to be a special 
case of Corollary 5.2 corresponding to j = 1. However it is worthwhile to note that this 
inequality can be proved in an elementary way. and even more can be said: 

Lemma 5.4. Let £ be parametrized as in (0.6). Then for any u < ijj{A) there exists an 

infinite subset A of Z,"^^ such that 

(5.3) |yq + p| < for ally € £ and all but finitely many (p, q) G A. 

This not only proves the lower bound (0.7>); but also provides a way to approximate all 
points of £ uniformly by a fixed sequence of integers. 

Proof. One knows that for any v < Lij{A) and infinitely many q g Z"~^ one can find 
P = {Po,Pi, ■ ■ ■ ,Ps) G 2**+^ satisfying (0.5). Now take any x e M*, denote (pi, . . . ,Ps) by p' 
and write 



Po + (x,x^) 



IPo + xp' + xAq| = |x(Aq + p)| < ||x|| || Aq + p|| . 
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Slightly decreasing v if needed, one gets 



(5.4) 



Po + (x, xA) 



< 



for all but finitely many (p,q) as above. Then it easily follows from (0.5) that ||p|| is 
bounded from above by C|[q||, where C depends only on A. Thus, after possibly another 

slight change of v and throwing away another finite subset, one can put 



in the 



right hand side of (5.4). □ 

We now turn to the equality cases in (0.7>), that is, to Theorem 0.4. The first observation 
is that from Lemma 5.3 and Corollary 5.2 one can immediately deduce (0.7=) for s = n — 1, 
that is, for column matrices <J=^ codimension one subspaces of R". Since both uj{C) and lu{A) 
are obviously infinite if columns of A are linearly dependent over Q, this proves Theorem 
0.4 in case (b). 

Another case when Theorem 0.4 holds for trivial reasons is C = {a}, a zero-dimensional 
subspace represented by a 1 x n matrix a G ffi". Indeed, it is a tautological statement that 
ui{C), that is, the Diophantine exponent of the ^-measure supported at a, is equal to a;(a). 
On the other hand everything done in §4 is easily applicable in the case s = (and x = 1). 
Thus it follows from Corollary 5.2 that Wj(a) < a;(a) for each j. (Exercise: prove it directly 
from the definition (5.2).) 

To finish the proof of Theorem 0.4 it remains to treat the case when A is a matrix with 
rationally proportional rows. For that it will be useful to get a better understanding of the 
'hidden symmetries' of higher order exponents. The three lemmas below serve this purpose. 

Lemma 5.5. For any A e Ms+i,n-s and all w e Sn+ij, '2. < j < n — s, one has 



(5.5a) 



and 
(5.5b) 



max max I ((ej + ai) A ej, w) I <C ||i?Ac(w) 

!=0,...,s 7c{0,...,n} 



|i?^c(w)|| <C max max I ((ej + a^) A e w)| 

i=0,...,s Jc{i+l,...,n} ' ' 



Note that in both cases the reverse inequalities are obvious. The statement is also obvious 
for J = 1 (since, as was mentioned before, the only possible choice of J is J = 0). Equation 
(5.5b) allows one to slightly reduce the set of pairs (i, J) involved in computations of Dio- 
phantine exponents. Also, (5.5a) says that one can enlarge the set of pairs in (5.1) so that 
the formula, and hence the definition of ujj{A), become symmetric under any^ permutation 
of rows of A. 

Proof. A crucial observation is the following: for any < /c < s, using (5.1) and the linearity 
of u {{ek + afc) A u, w), one can write 

(5.6a) |((efe + afe) Au,w)| < ||i?^c(w)|l 

whenever u e A"*"^!^) norm <C \\A\\. Likewise, assuming in addition that u € 
A^"^(Kefe+i © • • • ©Me„), one has 

(5.6b) |((efe -l-afe) A u,w)| < max |((efe + a^) A e^,w)| . 



^The invariance oi ujj{A) under permutations not involving the top row is an immediate consequence of 
(5.1); however the fact that the top row can also be permuted is nontrivial. 
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To prove (5.5a), take / C {1, . . . , n} of cardinality j — 2, and for any < A; < s write 

|((efc +afc) Aeo Aej,w)| = |(eo A (efe + ak) Aej,w)| 

< ||i?^c(w)|| + |(ao A (efc +afc) Ae7,w)| 

(5.6a) with u—(ek-\-^k)^Gi 

= \\Rac{w)\\ + |((efc +afe) Aao Ae7,w)| 
<< ||i?.c(w)||. 

(o.ba) with u— aoAe/ 

Similarly, if fc < i one has 

|((ei + ai) Ack Aej,w)| = |(efc A (e^ + aj) A e^, w)| 
< max |((ei +aj) Aej,w)| + |(afc A (ci + aj) Ae/,w)| 

(5.6b) with u=(ei+ai)Aej JC{fc+l,...,Ti} ' ^ /I l\ /I 

< max |((ei + aj) Aej,w)| , 

JC{k+l,...,n} ' ^ " 

and a repeated application of this trick allows one to reduce the estimation of ||i?^c(w)|| to 
J C {i + 1, . . . , n}, proving (5.5b). □ 

The invariance of Wj {A) with respect to permutations of rows of A suggests that the same 
might hold for other row operations. This indeed happens to be the case: 

Lemma 5.6. Let A' = BA for some B € GLs+i(Q); in other words, A' can be obtained 
from A by a sequence of elementary row operations with rational coefficients ( that is, trans- 
position of rows, addition of one row to another, and multiplication of a row by a nonzero 
rational number). Then cjj{A') = cOj{A) for all j. 

Proof. Since row interchanges are taken care of in view of the previous lemma, it remains 

to prove 

(5.7) u;j{A')>ioj{A) 

assuming all the rows of A' arc the same as those of A except for the top row, and the latter 
is equal to: (a) ^ao, where k and £ are nonzero integers, and (b) ao + ai. 

For case (a), take w € Sn+ij, write it in the form w = wq + eo A w' where both wq and 
w' are in /\{Vo), and put 

w = Iwq + keo A w' . 

It is easy to see that w also belongs to Sn+ij, that is, represents a subgroup of Z"+^: one 
can write 

w = (aeo + bvi) A V2 A • • • A Vj 
for some integer vectors vi , . . . , Vj e Vq and a, 6 G Z, and then take 

w = {iaeo + kbvi) A V2 A • • • A Vj . 

Now we claim that ||iiA'c(w)|| is not bigger than max (|A;|, |f |) times ||i?yic(w)||. Indeed, 
one has, for J c {!,..., n}, 

((eo + f ao) A e^, w) = (eo A e^, keo A w') + (|ao A e^, iwo) 

= k{{ej, w') + (ao A e^, wo)) = fc ((eo + ao) A e^, w) , 

and also, for i > 1, 

((ej + ai) Ae^,w) = ((e^ + ai) Aej,/wo) = ^((e^ + ai) A e^, wq) = -^((e, + aj) Ae^, w) . 
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This clcaiiy implies (5.7). In case (b) the argument is similar. Namely, we take w e Sn+ij, 
write it in the form 

w = Wo + eo A Wq + ei A w'j + eo A ei A w' , 

where wq, w', w^, are all in /\(Re2 © • • • © Ke„), and put 

w = w + eo A w'j = Wo + eo A (wg + w'^) + ei A w'^ + eg A ei A w' . 

Again, w can be easily shown to represent a subgroup of Z"+^: write 

w = (aeo + bei + vi) A (cei + V2) A V3 A • • • A Vj 

for some integer vectors vi , . . . , Vj e Me2 ® • • • ® ]Re„ and a,b,cG Z, and then take 

w = (aeo + &(eo + ei) + vi) A (c(eo + ei) + V2) A V3 A • • • A Vj . 

Now let us estimate ||i?^/c(w)||. It is clear that ((e^ + a^) A ej,w) is the same as ((e, + 
SLi) A Cj, w) for i > 1 and J C {1, . . . , n}. On the other hand, 

((eo + ao +ai) Aej, w) = (eo A e.,, w + eo A w'^) + (ao A ej,w) + (ai A e^, w) 
= ((eo +ao) Ae.,,w) + {ej,w[) + (ai Aej,w) 
= ((eo + ao) Aej, w) + ((ei + ai) A ej,w) . 

Therefore ||i?^/c(w)|| < 2||i?^c(w)||, so (5.7) holds as well. □ 

Lemma 5.7. Suppose that A has more than one row, and let A' be the matrix obtained 
from A by removing one of its rows. Then Wj{A') > ujj{A) for all j. If in addition the 
rem,oved row is a rational linear combination of the remaining rows, then u)j{A') = uij{A) 
for all j . 

Proof. Let A' be obtained from A by removing its top row (this can be assumed without 
loss of generality in view of the row interchange invariance). It is clear that for any w e 
Sn+i.j the inner product ((e, + a^) A e.,, w) coincides with ((e^ + a^) A Oj, 7r(w)) whenever 
i > and J C {l,...,n}. Thus it follows from (5.1) that ||i?^'c(7r(w)) || is not greater 
than ||i?^c(w)||, and obviously 7r(w) G Snj and 7r,(7r(w)) = 7r,(w). Therefore whenever 
w G Sn+i,j produces 'a good approximation' to A (meaning that ||i?^c(w)|| is smaller than 
||7r,(w)|| to some negative power), its projection 7r(w) onto A"'(^) yields an equally good 
(or better) approximation to A' . This proves the first part of the lemma. 

For the second part, in view of Lemma 5.6 it is enough to assume that all the coefficients 
in the linear combination are zero, that is, ao = 0. Then one can reverse the above argument: 
whenever w G A"'(M)) produces 'a good approximation' to A', it automatically yields an 
equally good approximation to A, since one has ((eo + ao) A ej, w) = (eo A Bj, w) = 0. □ 

A combination of the above lemma with the observation made before Lemma 5.5 com- 
pletes the proof of Theorem 0.4. In particular, we have shown that subspaces £ of the 
form 

C = {{xi, . . . ,Xs,ai, . . . ,an-s)} or C = {{xi, . . . ,Xs,aiXi, . . . ,a„-sXi)} 

satisfy (0.7=), thus generalizing [K2, Lemma 4.7] (whose method of proof, borrowed from 
[BBKM], did not shed any light on higher Diophantine exponents of the corresponding 
matrices). Furthermore, in view of Lemma 5.4 one can conclude that whenever a subspace 
satisfying the assumptions of Theorem 0.4 is not extremal (equivalently, uj{A) > n), for any 
u < uj{C) = uj{A) one can find an infinite A C such that (5.3) holds; that is, there 

exists an infinite supply of approximating vectors which can work uniformly for all points 
of C. 
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6. Generalizations and open questions 



6.1. For matrices with no rational dependence between rows or columns the exponents of 
orders higher than 1 seem to be hard to understand. In particular the following question, a 
special case of which was asked in [K2] , appears to be interesting: 

Question. Does there exist a matrix A G Mg+i^n-s o,nd 2 < j < n — s such that u)j{A) is 
greater than both n and co{A) ? 

An affirmative answer to this question would give a counterexample to (0.7=), and, 
moreover, would provide an example of a proper nonextremal afhne subspace C of M" such 
that for some v < u}{C) it is impossible to find an infinite subset A of Z"+^ such that 
|yq + p\ < ||q||~'' for all y G £ and all but finitely many (p, q) G A. On the other hand, a 
useful consequence of the validity of (0.7=) for all £ would be a possibility to compute the 
Hausdorff dimension of the set of subspaces of a given dimension and Diophantine exponent: 
indeed, then, in view of [Do], one would have 

(s + l)(n-s-l) + ^ i{v>n, 
(s + l)(n — s) otherwise. 



(6.1) dim N £ C 




By Theorem 0.4 this holds unconditionally for codimension one subspaces, that is, 
dim ( i £ C 



dim{C) = n- ll\ fii±i ifv>n. 



w(£) = V 11 { n otherwise. 

The author unfoimdedly suspects (6.1) to be true regardless of the answer to the above 
question; in other words, even if higher order Diophantine exponents can interfere with 
(0.7=), they conjecturally should not be powerful enough to affect the computation of the 
Hausdorff dimension. 

6.2. The simplest matrices to look for possible counterexamples to (0.7=) would be of size 
2x2, corresponding to lines in M^. To convince the reader that the problem is far from 
trivial, let us work out an explicit formula for the second order Diophantine exponent of 

. Write w G 54,2 in the form 



: 


^ ao2 


ao3 \ 


. Write 




V ai2 


ai3 / 





(6.2) w = peo A ei + ^ Wij ej A ■ + g 62 A 63 , 

i=o, 1 

j=2, 3 

and observe that for i = 0, 1 one has 

^ ^ ((ej + ai) A 62, w) = - flisg, 

(6-3) 

((ej + ai) A 63, w) = ^^3 + ai2q , 

and also 

((eo + ao) A ei,w) = p - ao2Wi2 - 003^13 

(6.4) 

= p- det{A)q - 002(^12 - ais?) - 003(^13 + ai2q) ■ 

Combining (6.3) and (6.4), one concludes that the definition (5.2) of ui2{A) reduces to the 
following: uj2iA) is the supremum of v for which there exist w G 84,^2 of the form (6.2) with 
arbitrary large |g| such that 

max (|wo2 - 00391, 1^12 - ai3Q\, \wo3 + 002?!, \wia + ai2g|, \p - det{A)q\) < \q\ ~ . 

Even when det(y4.) = (that is, rows/columns of A are linearly dependent over M but not 
over Q), the situation does not seem to be any less complicated. 
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6.3. Theorem 2.2 can be used to treat the so-caUed multiplicative versions of the Diophan- 
tine problems discussed in this paper. Namely, define 



n+(q) = n 1*1' 

denote by the set of y G M" for which there are infinitely many q G Z" such that 

|yq + p| <n^(q)-''/" 
for some p G Z, and then define multiplicative Diophantine exponents: 

• (y) of y by o)^ (y) =^ svip{v | y G W^"" } , 

• u)^ (fx) of /i by Lo^' ifx) sup { V I /z(>V„^ ) > 0} . 

It is easy to see that u)^{y) is not less than a'(y) for all y, and w^(y) = n for A-a.e. 

y G K", that is, lu^ (A) = n. To adapt the methods of the present paper to this set-up, one 
needs, following [KM] and [K2, §5], to replace the one-parameter flow (3.2) by the action of 
the multi-parameter semigroup 

diag(e*, e~*\ . . . , e~*") , where ti>0 and t = ti + \-tn . 

This way it should be possible to prove Theorem 0.3 with co replaced by co^ (this was done 

in [K2] under the assumption that C is strongly extremal, that is, w^(£) = n), and also 
derive formulas for multiplicative Diophantine exponents of affine subspaces. 

6.4. With some abuse of notation, let us denote by cr(y) the Diophantine exponent of the 
column matrix given by the vector y G K", that is, 

cr(y) = sup {v I 3 oo-many g G Z with \\qy + p|| < \q\~'" for some p G Z"} , 

and, similarly to what was done with lj, extend it to measures on M". It follows from 

Khintchine's Transference Principle that cj(y) = n if and only iff cr(y) attains its smallest 
possible value, i.e. is equal to 1/n. Therefore in all the problems related to the extremahty of 
manifolds /measures, it makes no difference whether to interpret points of M" as row vectors 
(linear forms) or as column vectors (the latter set-up was employed in [KLW, KWl-2] and 
is usually referred to as simultaneous approximation, hence our choice of notation a). The 
situation is however different when Diophantine exponents are bounded away from their 
critical values. Indeed, [C, Ch. V, Theorem IV] estimates cr(y) in terms of u){y) as follows: 

^^^M^^>.(y)> 



1 + n/uj{y) ' 

and the above inequalities are known to be sharp. Thus in general it is not possible to 
extract any information concerning a{fj,) when a;(/i) is known. 

On the other hand, the methods of this paper can be adapted to computations of the 
'simultaneous Diophantine exponents' of manifolds and measures. Indeed, given a column 

vector y G K" one simply needs to work with the collection of vectors of the form ^ ^ ^ 

where p G Z" and g G Z, which is the same as UyZ"+^, where one uses Uy =^ ^ 

instead of (3.1). Then, to study the simultaneous approximation properties of y, one uses 
the action by 

5t = diag(e*/",...,e*/",e-*) 

which miiformly expands the first n coordinates of vectors in R"+^ and contracts the last 
one. This way an application of Theorem 2.2 can yield Theorem 0.3 with lu replaced by a, 
as well as, after a multi-parameter modification, by its multiplicative analogue. 
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6.5. Studying Diophantine exponents of matrices is a special case of a more general prob- 
lem, where one replaces the right hand side of (0.5) by an arbitrary function of ||q||. Let 
us specialize to the case of row vectors and use the following definition [K2, §6.3]: for a 
nonincreasing function : N ^ (0, oo), let stand for the set of y S R" for which there 
are infinitely many q G Z" such that 



By Groshev's Theorem (see e.g. [Sc2]) almost no (resp., almost all) y e K" belong to if 
the series 



converges (resp., diverges). More generally, in [BD] submanifolds Ai of M" are called of 
Groshcv type for convergence (resp., divcTgcnce) if the convergence (resp., divergence) of 
(6.5) implies that almost no (resp., almost all) points of M are in W^. It is known [BKM, Be, 
BBKM] that nondegenerate submanifolds M of M" are of Groshev type for both convergence 
and divergence. 

The situation is much less understood when M as above is replaced by a proper afRne 
subspace jC of M". Clearly w(£) must be equal to n in order for C to be of Groshev type 
for convergence (since the choice ip{k) = fc^" for v > n makes the series (6.1) converge) but 
the converse is not likely to be true. On the other hand, in the following two cases it has 
been proved that C parametrized as in (0.6) is of Groshev type for convergence under the 
assumption that lj{A) is strictly less than n: when £ is a line passing through the origin 
[BBDD], and when it is of codimension one [Gl]. In the former case £ was also proved to 
be of Groshev type for divergence. Note that both cases fall into the framework of Theorem 
0.4, that is, admit a simple formula (0.7=) for the Diophantine exponent of C In view of 
Corollary 5.2 it seems natural to make the following 

Conjecture. Let L he 'parametrized by (0.6), and suppose that iOj{A) is strictly less than n 
for every j . Then £ is of Groshev type for both convergence and divergence. 

The author expects this conjecture, as well as the multiplicative analogue of its conver- 
gence case, to be provable by a combination of the methods of [BKM, BBKM, G1,G3] and 

the present paper. 

More generally, for any submanifold of M" it should be possible to state its own version of 
Groshev's Theorem, with the convergence/divergence of (6.5) replaced by another 'dividing 
line' condition. The following problems, posed in [K2], still remain wide open: 

• is it true that the aforementioned 'dividing line' condition of an aSine subspace £ of 
R" is always inherited by manifolds nondegenerate in £7 

• given a general affine subspace, say parametrized as in (0.6), find its 'dividing line' 

condition, say in terms of the Diophantine properties of the parametrizing matrix 
A: or, vice versa, describe the class of subspaces with a given 'dividing line'. 

6.6. Finally, we remark that the generality of Theorem 2.2 allows applications far beyond 
Diophantine approximation over R. Namely, one can similarly consider Diophantine prop- 
erties of measures on vector spaces over non- Archimedean local fields, both of characteristic 
zero [KT] and of positive characteristic [G2]. In fact, in the aforementioned two papers it 
was proved that manifolds nondegenerate in the ambient spaces are extremal (and moreover 
strongly extremal). An application of Theorem 2.2 can extend these results to manifolds 
nondegenerate in proper affine subspaces. 



yq + p|| < V'(l|q||) for some peZ. 



oc 



(6.5) 
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